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IMAGE COMPRESSION 



CROSS-REFERENCE TO RELATED APPLICATIONS 

The following copending provisional application disclose related subject 
matter serial no. (docket 26414), filed 7/18/97 . 



BACKGROUND OF THE INVENTION 

The invention relates to electric image methods and devices, and, more 
particularly, to dtftf communication and storage systems with compressed irrages. 

Video communication (television, teleconferencing, Internet, and so forth) 
typically transmits a stream of video frames (pictures, images) along with audio over 
a transmission channel for real time viewing and listening or storage. However 
.ransmissfcn channels frequently add corrupting noise and have iimKed bandwrdth. 
Consequently, digital video transmission rth compression enjoys widespread use. 
, n particular, various standards for compression of digita. video have emerged and 
include H.261 , MPEG-1 , and MPEG-2. with more to follow, including in development 
H 263 and MPEG-4. There are similar audio compression methods. 

Tekalp Digital Video Processing (Prentice Hall 1995), Clarke, Digital 
Compression of Stil, Images and Video (Academic Press 1995), and Schafer e. al. 
Digital Video Coding Standards and Their Role in Video Communications, 83 Proc. 
IEEE 907 (1995), include summaries of various compression methods, includmg 
descriptions of the H.261, MPEG-1, and MPEG-2 standards plus the H.263 
recommendations and indications of the desired functionalities of MPEG-4. These 
references and all other references cited are hereby incorporated by reference. 

H 261 compression uses interframe prediction to reduce temporal 
redundancy and discrete cosine transform (DCT) on a block level together wi* high 
spatia, frequency cutoff to reduce spatial redundancy. H.261 is recommendec I for 
use with transmission rates in multiples of 64 Kbps (kilobits per second) to 2 Mbps 
(megabits per second). 
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The H 263 recommendation is analogous to H.261 but for bi.ra.es of about 
22 Kbps (twisted pair telephone wire oompatibie) and wtth motion elation at 
7* L accuracy (which eliminates the need for loop filtenng ava.lable ,n H.261 ) 

motion compensation with 16 by 16 macroblock and 8 by 8 blocks^ 

MPEG-landMP^alsousetemporalpredictionfollowedbytwo 

dimensional OCT transform^ on a block leve, as H.261 , but they make furth 

TLriouscombinationso™^ 
Irrtraframe coding. MPEG-1 aims at video CDs and works well at rates about 1-1 .5 
22 frames of about 360 pixels by 240 lines and 24-30 frames per second. 

^.compensated b W irectk,na, prediction/interpolation from ad )a cent I and P 

,rameS M P EG -2 aims at digital television ,720 pixeis by 480 lines) and uses grates 
uo to about 10 Mbps with MPEG-1 type motion compensation with I, P. and B 

reS °' U "?the foregoing MPBG compression methods result in a number o, 
unacceptable artifacts such as blockiness and unnatural object motion when 
oTeld a, very-low-bK-rates. Because these technics use only the statical 

0 L video stream, artifacts are introduced a, the block boundanes un er 
a ,o nhvsical boundaries of the moving objects and hence v,sually 

the frame rate to fall below that required for smooth moton. 

MPEG-4 is to apply to transmission bitrates of 10 Kbps to 1 Mbps and ,s to 
use a content-based coding approach wfth fcnctionalities such as scalab,l,ty. 
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operafing across MM" channels ™ v|de0 ^ has 

missing ob]ects. MPEG, Shapiro, 

For encoding a single frame as ,n JPEG or it 

Tr.Sig.Proc 3445 (1993) provides aw coefficients 

bi,rate Viiiasenor e, a,, Waveiet Fiiter Evaluation ,or image Compression 4 iEEE 

various mother waveless. nroWem 

Hardware and software implementat.ons of the JPEG, H.2B1 , m 
MP impression and aecoding e X is, Purthe, pr^arr — processors 
or digital signal processors, such as the Ultrasparc or TMS320C6*. runnmg 
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plsso. may hand,e ,ower citrate compression and decompress.0, 

SUMMARY OF THE INVENTION 

The present invention provides image compression for simpie ,mages w*h 
bi ,p,ane co text modeling and for images containing both simpie and nature ,mage 

ze rotree coding using a tiling of the image to segment the portions. 

This has the advantage of decreasing the bb required for cod,ng v, m> 
ocnpensa.n retired in a decoder and can he used for JPEG or M PEG , frames 

or objects. 

BRIEF DESCRIPTION OF THE DRAWINGS 
The drawings are schematic for clarity. 
Figure 1 is a flow diagram for a preferred embodiment encod.ng. 
Figures 2a-c illustrates subband hierarchical decompos.t.on. 
Figures 3a-d show coefficient encoding- 
Figures 4a-b show flow for a dominant pass and the states. 
Figures 5a-c indicate empirical results. 
Figure 6 illustrates the combination preferred embodiment. 
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DESCRIPTION OF THE PREFERRED EMBODIMENTS 
expla.ned w* the help o an<j four ^ ^ „ 

ea c h o 1 Lf,,^^^ 

b t 7 6 Altered frames (boundary pixel values are used to extend the frame f r 

.oie atk=0 1 and is zero for all other k.niwequd.s 

riallly convoiution with the functions, so h 0 Is a lowpass filter ,n one 
Conine, lowpass-lowpass, lowpass-bighpass, Wg ,pas,,owpass. and 

88 subarrays of wavelet coefficients, denoted LL1 , LH1 , HL1 , and HH1, 
Iplely withcoefficientlocationsOWrelabeiledfor and 0,^7. 

the first level wavelet coefficients. 
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The LL1 LH1, HL1, and HH1 subarrays can be used to reconstruct the 
original frame by firs, interpolating each subarray by a factor of four (to restore 
Z by 1T6 size), then filtering the four 144 by 176 arrays w«h fibers goGteoW 
godtolCO. OlfflgoW. gi(i)91«. respectively, and ,as«y pM. add,ng these 

fflters respectively, and relate to ho and hi by go(n) - (-1) hl(n) and 91 n)= 

IL 0 las wouid be the case for an odd tap filter, so after re— on the p«e, 
index is shifted by 1 to adjust for the two 1/2 pixel shifts during two filtenngs. 

The second level in the decomposition simply repeats the four fiKenngs w,th 
, he ho and hi functus plus subsampling by a factor o, four but using the LL1 
subarray as the input. Thus the four filtered subarrays are each 36 by 44 and 
d" L l U* «. ^ HH, As before, the U, LH2, HL2, and HH2 can be 
arranged to visualize the decomposition o, LL1 and also <^»»«« 

reconstruction o,LL1 with the go and gi based filters. The LH1, HL1, and HH1 

subarrays of first level coefficients remain unfiltered. 

Repeat this decomposite on U2 by filtering wtth the four filters based on h 0 

and hi followed by subsampling to obtain LL3, LH3, HL3, and HH3 which are 18 by 

a •„ 1 1 1 I hi 3 and HH3 can be arranged to 
22 coefficient subarrays. Again, LL3, LH3, HLo, ana nn 

visualize the decomposition of LL2. 

Compiete the hierarchical four level decomposition ofthe ong.nal frame by a 
,ast filtering with the four fiiters based on ho and M followed by subsampling o, 
*o obtain LL4, LH4, HL4, and HH4 which are each a 9 rows of 1 1 coefficients 
subarray. Figure 2c illustrates al, of the resulting subarrays arranged to form an 

ooefflcients in various levels ofthe decompose indeed, a coefficent w(,k) ,n LH4 
is the result of filtering and subsampling of coefficients xfl.k) ,n LL3. 
w(j,k) = ho(0)hl(0)x(2j,2k) + ho(0)hl(1)x(2j,2k-l) 
h 0 (0)hl(-1)x(2j,2k+1) + ho(0)hl(2)x(2j,2k-2) 
ho(0)hl(-2)x(2j,2k+2) + ho(0)h1(3)x(2j,2k-3) 
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h 0 (1)hl(0)x(2j- 1.2k) + h 0 (1)hl(1)x(2j- 1,2k- 1) 
ho(1)hl(-1)x(2H.2k+1> + hO(Dhl(2)x(2j-1,2k-2) 
h 0 (1)hl(-2)x(2j-1,2k+2) + h 0 (1)hl(3)x(2j-1,2k-3) 
Because the filtering plus subsampling is basically computing w(j,k) from the 
2 by 2 area in LL3 (the values of hi(k) are small except for k=0.1 ), there are four 
coefficients (x(2i-1, 2k-1). x(2j-1.2k), x(2i,2k-1) and x(2i,2k)) in LL3 which determine 
w(j.k)inLH4. Now each of these four coefficients in LL3 is related to a 
corresponding one of the four coefficients in LH3 at the same positions ((2,- 1 2k-1), 
(2i-1 2k) (2j.2k-1 ) and (2j,2k)) because they were computed from essentially the 
He 4 by 4 location in LL2. Thus the coefficient w(i,k) in LH4 is called the parent of 
the four related coefficients z(2j-1 ,2k-1), z<2j-1,2k), z(2j,2M) and z(2j.2k) in LH3 
and each of these four coefficients in LH3 is a child of the parent coefficient ,n LH4. 
This terminology extends to LH2 and LH1 . The generic term descendant includes 
children, children of children, and so forth. See Figure 2c showing descendants. 

Using the hierarchical decomposition of the original frame into coefflcent 
subarrays LL4, LH4, HH1, begin the modified embedded zerotree encoding: first 
find the maximum of the magnitudes of the coefficients wfl.k, in the coefficient array 
(union of subarrays LL4, LH4, HL4, ... LH1. HL1, HH1) except exclude the 
coefficients in LL4 which will be separately encoded. Then pick an inrt,al 

quantization threshold, To, so that 

Tosmax|w(j,k)|<2To 
The initial threshold value. To, is encoded (with a variable length code) and 

made part of the bitstream. 

Embedded zerotree encoding essentially encodes w(j,k> by using a bmary 
expansion of wO.kXTo with successive scans of the coefficient array adding 
successive bits in the expansion. This provides a fully embedded bitstream w,th 
resolution increasing over an entire reconstructed image on each scan of the array. 
For example, a background transmitted over the Internet could be improved w,th 
updates between data transmissions. Scan the wavelet coefficient array ,n the 
order of lowpass to highpass; that is, the subarrays are scanned in the order of 
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, H4 HL4 HL1 HH1 with each subarray raster scanned (a row at a time). Thus 
location in the coefficient array by order in the bitstream. 

coefficients and simple zerotree coding does not have any ga,n over PCM ndeed, 
S codes each of the 99 (9 by 1 1) coef^ents individually, and success- b* 
planes (one bi, from each of the 99 coefficient.) provide success,ve.y better 

Thus PCM can be part o, a lully embedded bitstream syntax. DPCM 

clients butthisdisruptsfuilembedding. U4 is basically a low resolut,on 

the ptels in a t6 by , ^^^^Z PCM 
of the bitstream is practical, figure 1 shows the case 

1 full embedding, the looping with threshold decrementing would ,ndude the code 

b ^t d ::l scan subam, L H4 and encode each of the 99 — 

lotree root with zero value) » ,w«,K), < To and a„ descendants of wO* (4 i 

root with significant value) if |w(j,k)| £ To but all descendants of w(j,k) ( 4 ,n ^3, 1 
7 H2 and 64 in LH1) have magnitudes less than To, « IZ (isolated zero) ,f M,K)| 

coefficients in LH4 are raster scanned, and the encoding generates ^ » each 
coefficient takes two Ms or, preferably, generates fewer b*s ,f an adaphve 

ZTRZ or ZTRS, men the decoder determines that al, of the descendantwav let 
coefficients have magnitude less than T 0 and so these coefficients need no be 
coded in this scan o, the coefficient array because they are already Known to be 
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insignificant Ffcure 3a illumes the range of coefficient va.ues and codes 

,o the list if w(i.k) is coded as ,S plus replace it wKh symbo, X ,n the LH4 subarray. 

T dPd as ZTRZ or IZ may be further resolved as ZTRS or IS and 

:::r r e : ^ sy mb o,s or ,, 0 „ e * « 

' S a ' S0 devaluates as a 0 for threshold comparisons (i.e., as in checking to see 

L a removed significant coefficient implies fewer coefficients may be needed ,o 
IZLubsiuentscansofthearra, There Is a tradeoff be^een ( 1)us^ the 
Col as the replacement for significant coefficients which codes fewer b,,s 
sILg the X symbols on successive scans and (2) using a 0 as the replacement 
X-t^cientswhichwillsavebHswhentheOisfoundtobeazerotree 

HL4 and then of HH4 along w«h the appending to the lis, o, significant coefficent 
nlus reDlacing by X of coefficients coded as ZTRS or IS. 

and encoding forthe third level subarrays LH3, HL3 and «^^^ 
subarray a wavelet coefficient which has a parent coded as »r ZTRS^ust 
skip ped;the decoder knows the locations of ^^^'^ 
Similarly, scan and encode the remaining subanays ,n order LH2, HL . HH2, 
LH1 HL1 and HH1 along with the appending to the significant coefficent fist and 
replacement by symbo, X for the coefficients coded as IS. Figure 4a is a flow 
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diagram o* the entire array scan where the Increment scan variable" means raster 
scan of the subarrays in order. The array scan and coding is termed a dom,nant 

nass throuqh the array. 

The decoder may recent a frame from the coefficient codes by us,nga 

values of ± 3T 0 ,2 for coefficient coded ZTRS or ,S and a vaiue of 0 for coefficients 
coded ZTRZ or IZ. This encoding essentiaiiy is a map of the location (and s^n) of 

significant coefficients (greater than threshold). 

Next,encodethemembersofthelistofsigniflcant«>efficients',na 

subordinate pass in which each member has one more bit coded as follows: w ,k) 
was previously coded as significant and positive (which means To , w 0 k) < 2T0 . 

w0k) coded as significant and negate codeaOfor-2To<w«.K)^To/2anda1 
,or- 3 T0/2<w 0 ,k) S -TO- Note these are just the second most signflcant b„s ,n 
the binary expansion of w(J,k)/To; ZTRS, IS, and a sign bit would be the sign and 
1, significant » (wKh negative numbers in two's complement format). F,gure 3b 

heuristically illustrates the range of codes. 

After completing the foregoing scan and ZTRZ-ZTRS-IZ-IS-skip cod.ng (a 
dominant pass through the coefficient array) plus the additional bit for the members 
of the significant coefficient list (subordinate pass), replace T 0 by T, = To/2 and 
re pea. the dominant pass and subordinate pass with T, as the threshold. F ,gure 3c 
iLra.es the coefficient ranges for the dominant pass, and Figure 3d illustrates « e 

„, significant coefficients plus replaces them with Xs, and the subord,nate pass adds 
an addMonal bit of resolufion for each members of the list. During the dominant 
pass the X value of a coefficient in the array is treated as a 0 for threshold 
comparisons but is skipped rather than being encoded a ZTRZ or XL. On the 
average this use of X decreases the number of bits that need to be transmuted; see 
Fkjur es 5a-c illustrating experimental result of the gain using X in connection w,th 
the preferred embodiment. 
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This successive decreases in the quantization threshold provides increasingly 
higher resolution recensions of the original frame. Further, if the initial threshold, 
To, may be selected for computational convenience provided in lies the range of 
half the coefficient maximum to the coefficient maximum. 

In the original embedded zerotree algoriths of Shapiro, the wavelet 
coefficients are coded in several kpasses. Each pass encodes one The 
positions of the coefficients that become significant with respect to the new 
threshold are encoded efficiently Wrth zerotrees, in which each node of the tree 
represents the significance of the coefficient a, the node and the coefficients in the 
substree rooted at the current node (one could consider a zerotree as essenfially a 
significance map). The original embedded zerotree algorithm use the fol,ow,ng 
symbols to represnt the significance of the nodes: ZTR, IZ, POS, NEG. ZTR 
represents a node where the coefficieent itself is zero and all its descendants are 
zero IZ represents a node where the coeficient itself is zero and not all oof ,ts 
descendants are zero, POS represents a node where the coeficient itself is posifive, 
and NEG represnts a node where the coefficient itself is negative. It can be shown 
that statistically the sero wavelet coeficients tend to cluster in the same spat,al 
nation, and the conditional probability for zero coefficient is much higher given the 
parent of the coefficient being zero. This explains why the zerotree quantized 
reduces the overhead for the significance map and provides good coding efficiency. 

The preferred embodiment improves coding efficiency with a different set of 
symbols- ZTRZ, ZTRS, IZ, IS. ZTRZ represents the node where the coefficient rtself 
is zero as well as all of its descendants. ZTRS represents the node where the 
coefficient itself is nonzero, but alll of its descendants are zero. IZ is a node where 
,he coefficient is zero but not all of its descendants are zero; and IS represents a 
significant coefficient whose descendants are not all zero. 

I„ comparison with the original symbols, the preferred embodiment replaces 
POS and NEG symbols with one symbol IS because the possibilfty for posrtrve 
numbers and negative numbers are about equal. By using one symbol, this reduces 



TI-26414AA Page 11 



the number of symbols used to reduce complexity and increase accuracy for 

probabilitgy estimation. „ , h = ttrs svmbol to make 

,„ addition, the preferred embodiment mtroduces the ZTFS sym o 

1 8peeWm of the process is a decaying 

autoregressive process, the frequenc p ^ ^ ^ 

function with respect to frequency. The rate of decay 
also decay with respect to scales. Since in zeroiree q 

using the new symbol set. 

Cor«°er^^ 
So^^c^^ 
Arithmetic coding is Known to be able to optimaily encode a stationay random 
Znce if the statistics of the random signa, can be estimated accurately, in 
pX.artthmeticcodingcan provide very good coding performance for smai, 
symboi sets, which is the case in the preferred embodiment. 

The statistics are estimated with accumuiative frequencies. Forgettmg 
factors are used to adjust the adaptation window size of the frequency estima - 
Tfor etting factor aliows the arithmetic codes to adiust to the iocal sta st,ca,l. 
However^ to^ small an adaptation window will fluctuate the statistics too ^ U 
wlUrndegradesthe performance. ,n the preferred embodiment, choosethe 
forgetting factorto be 127, which empirically gives the best results. 
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es Je^pl^ais.^onof.es^s. The context is de,erm,ned by 
two factors: 

(1 ) the state of the coefficient in the previous pass 

(2) the subband that the coefficient is in 

instance If a coefficient is a descendant of a zero zerotree root (ZTRZ) ft. 
Ts ass then Ks probata of being zero in the current pass ,s s,gn,f,can% 

X ^ F lure 4b iiiustrates the state —n graph for a coefficient from a 

Z to the current pass. The additional symbols DZ, DS are for ,n.ernal 

Porinstance for the highest subband, there will be no zerotree roots. When 

7TR7 ZTRS to be zero, because they will not appear m that subband. 

^^rdine te^ 

, nn rnllld be used for the subordinate pass. In the preferred 

respectively. 

, Ts Transform and S+P Transform instead of the foregoing floafng point 
It extended to lossless coding. The bKstream is —sly scalable from 
compression performance for the USC image date set us,ng the S + P Transform. 
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Image compression (bit/pixel) 
4.23 
4.83 
4.08 
5.86 

man 4.59 
woman 1 4.73 
woman2 3 30 

average 452 



Lena 
couple 
crowd 
lax 



Codingsimpjejmages 

L„ outers o, a*e intensity ieveis. For example, many intense levels are 
rerusedw^intheimage/su^age. Cai, such images mages. 

11 bilevei images; computer genreated graphics * simple textures, e, 

Ce,era Because wavelet transform is no. very effective in compression of simple 
imag es an image domain compression method can be employed as an atom*. 

option CREW achieved significant gains in cmpression performance for simple 
images, and for compound images conttnaing simple subimages. 

The CREW algorithm was proposed in the context of lossless or near 
,ossles S compression; however, even in a lossy comporessfcn environment these 
^pes of approaches are useful. In parTicuar,, when compound documents are 
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considered efficient performance can be achieved even wtth lossiess or near 
lossless compression of the image regions. 

The preferred embodiment simple image compression method proceeds as 
follows. Presume a segmentation procedure has been used to divide an input 
imag e into simple and natural image regions. The compression method for he 
simple regions processes each bttplane successively, from most sign,ficant to least 
signfficant bits; this achieves an embedded bitstream. A binary an.hme.ic coder 
(e o USP 4,122,440) is used to to compress each bftplane. The condit,onal 
probability mode, of the arithmetic coder is instrumental for good compression 
performance, and it is described in the following. 

Given an input bit to the bina^ arithmetic coder in the currant brtplane, the 
context model is composed of two sets of bits: one from neighboring Ms in the 
same bitplane-say ft. and the other from the bb in the other bitpianes-say C_ 
,„ order to not increase overt-ead to estate the conditional probabilrty of the 
current symbol, use a causal context, i.e., b*s from the causal past, which have 
already been processed by the arithmetic coder and thus known to the encoder and 
the decoder. 

The reason to consider two dflerent sets of context bits ,n clean first, the 
strong spatial correlation among image pixels also extends to some extent to ,ts 
bitplanes and thus using ft.* be beneficial. Second, ft. is useful because 
because the images as simple images. Consider as an example a bilevel image 
scanned as an 8 bit image. In this case the two most My M. are going to be 
0000 0000 and 1111 1111 and possibly levels which differ only in some of the least 
significant bits. Thus If a oartiucarl pixel's MSB is 1 , with high likelihood the 
remaining bits will also be 1 . This motivates the fact that for simple images ga,n can 
be achieved by using information about the previously scanned bitplanes. Th,s 
iustifies the potential benefits of C„. Note that a similar inturtion applies in the case 
where the original document scanned to 8 H images are not bilevel. Because ,n 
.hose simple images the number of grayscale levels is much less than the maximum 
256 for each given pattern of, say, 4 most significant bits, the number of poss,ble 
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cognations o, «ha 4 leas, signtican. b«s is likely * be iess than 16 and m ay indeed 
be biased ,o one particular ievei. This can be exploded through context mdel,ng. 
Note that gains can aiso be achieved for simpie images by performing histogram 
companion (e.g., if 32 leveis are used in an 8 bit image, mapping first hose 32 
,ave,s into 5 btts code, and then doing bitpiane for each of those 5 bitpianes). 

Once the context model has been selected, the compression method 
processes each bKplane in the usual raster scan order. Thie only necessary side 
information is the dimension of the input image, and the creation map wh,ch 
describes the location of simple and natural regions. 

The preferred embodiment uses 4 bits in C» the Ms adjacent to the current 
one in the W, N, NW, and NE directions. When confessing the k-th Mplane 
,K-0 7) use k bits from the 0-th to (k-1>* bitpianes in C where bitpiane 0 
corresponds to the MSB and tftpta. 7 to the LSB. This can be regard as a very 

neighborhood in C intra . 

This method can be compared to JBIG compression independently applied 
bitpiane by bitpiane. Experimental results indicate that the preferred embodiment 
outperforms the JBIG scheme for simpler images. 

The test images are obtained by preprocessing their correspond^ JPEG 
2000 test images. The preprocessing used is needed to replace 64x64 tiles in the 
original image with a black tile (or all zero tile) If the tile histogram 
than the prespecKced number of levels. For instance, cmpndl .idp ,s obta.nedwth a 
threshold of 16 while the other two images with 64. The results: 



image 


orignal size 


pref.emb. 


JBIG 


cmpndl. idp 


393216 


3370 


16056 


us.idp 


262144 


21150 


24011 


target.idp 


229376 


11468 


9739 
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Histogram compression can be used to preprocess simple images to help the 
performnace of the preferred embodiment method. If the number of used inters,* 

use and represent each pixel intensity with a reduced number of ,bts. 

using only 5 ibts per pixel after specifying those 29 levels explicitly. This makes the 
preferred embdoiment method avoid compressing 3 Upton* so as to increase 
expression efficiency. However, when to use histogram compaction must be 
Lided carefully, taking into accent the tradeoff between bits added by overhead 
and saved by removal of redundant bitplanes. 

Experiments indicate that histogram compaction can help com P ress,on of 
some simple images. Firs,, preprocess us.raw (512x448 pixeis) with a threshold o, 
24 to obtain us.idp (this contains only the simple regions in the original ,mage)^ 
When compressing this image without preceding histogram cm P act,on, y,elded a 
of 9161 bytes. When combined with hitogram compaction, only 6 bitplanes are 
necessary (or 42 intesnity levels are used for the whole image) and tahe 
compressed lie size is as small as 8373 bytes. 

H has to be noted that the preferred embdoimen, provides much o fhte gam 
achievable from histogram cmparction w«hou, some o, its drawbars. Spe« 
tnere is no need to send overhead btts and the mehtod adapts automat,cally to the 
occurrenceofdifferenthistogramsindifferentregionsoftheimage. 

While better resuls can be expected by using JPEG LS or CAL.C for the 
same images, the preferred embodiment is very simple and offers embedded 
bitstream and b* scalablilty. Thus provided there is a reasonable way to segment 
compund image (e.g., cmpndl .raw), this lossless compression method can be 
incorporated with an embedded wavelet coder for compound .mages, ,.e„ apply th,s 
method to me simpler area (e.g., text regions of cmpndl .raw) and wave,, coder to 
the natural image area (e.g., photo region of cmpndl. raw). 

The remaining issue is to determine effective ways o, separating s,m P le and 
natural image regions. This task is left to the encoder and appropriate overhead ,s 



TI-26414AA Page 17 



sen. to the decoder to indicate the classification, in our current version we use 
lo,si Z e64 x 64 hut constrain, the na, ra l regions to he rectangular™* 
scheme can he improved i, arbnrary region shapes are possible. One such 

each tile (with symmetric extensions at the edges). In this way the loss ,n 
lice forthe natural image regions wil, heminimal. in addition, g,ven, a, 

1 hitstreams use hitplane encoding the overhead a, the decoder . m,n,mal. 
^lythedecoderusesacon,e,adap,ivea ri *meticcoder,orhothreg^ 

and the main differences are: 

o No filtering is used for the simple image regions, 
o The context are different for each region type. 

For simple regions low spatial resolution (smaller) images have less value 
than for nature, images (for example, the textual information quicKly becomes 
illegiW e after downsampiing). However the encoder has the op„on of proving 
Lent types of scalability » so desired. In one scenario, the encoder can send the 
Tl image tiies in the order specified above, i.e., low frequency subbands are 

T rder of scan, one line at a time. Alternafively a low resolution represent f 
le tple image tiles can he embedded into the zero-tree structure. For example, 
^~**»*»*+*"~*-"»"+- Fromthisonecan 
extract 3 32 x 32 files which can be embedded within the higher frequency 
subbands (but transmitted using a different probability and context model)Jh,s 
allows complete intedeaving of the simple and natural image hitstreams. The 

frequency bands. Note that several methods are available to generate the 
image. For example, one could use simple downsampiing to generate the lower 
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spatial — tiles. Alternately, a pyramidal structure could be bui, In etther 
case, the same bitplane by bftplane emission method is preserved so that mo* 
secant b*s are sent prior to the least secant ibts, wKh the former be,ng sued 
as context for the latter. 
Experi mental results 

^^s^es the improvement in PSNR at various transm,ss,on b rates 
from using the preferred embodiment predictive zerotree (using ne« -symbols) over 
the standard zerotree wKh the baseband separately DPCM encoded and w,«h the 
following 9-3 tap Daubechies filter functions for the hierarchical decompose: 
h0 = [0 03314563036812, -0.06629126073624, -0.17677669529665, 
041984465132952,0.99436891104360,0.41984465132952, 

-0 17677669529665, -0.06629126073624, 0.03314563036812] 
hl = [0 35355339059327, 0.70710678118655, 0.35355339059327] 
Th e overall bitstream thus has an in«ia, block of bits fully encoding LL4, then a block 
of b*s encoding significant coefficient location using initial quantrzation thres olds, 
then a block of t» adding one bit accuracy to each of the signfflcant pocels, then a 
blockofbitsencodingnewly-signiflcantcoefficientlocationusingrefmed 

quantization thresholds, then a block of bits adding one bit accuracy to each of the 
significant coefficients (both with initial quantization thresholds and refined 
thresholds,, and so forth until target quanfization refinement or other bitstream s,ze 
or bitrate constraint occurs. 

Figure 6 compares the results using a single method versus using two drfferent 
me thods, one for the simple image portions and one another for the natural image 
portions, in these results reflect the straightforward approach of sending separately 
(i e without interleaving) the simple and natural tiles. 
Note that lossless compression is achieved at 2.62 bits per pixel when us.ng 
modified zerotree and at 1 .29 bit per pixel when using the combination of simple and 
natural, over a 40% gain. 



Modifications 
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The preferred embodiments may be varied in many ways while retaining one or 
more of their features of zerotree coding with a wildcard symbol used for 
replacement of significant coefficients. 

For example, the filters used for ,oss and lossless modes of operation could e 
different; the size of the images orframes, the number of decomposition levels, the 
initial thresholds, quantization levels, symbols, and so forth can be changed 
Generally, subband filtering of other types such as QMF and Johnston could be 
used in place of the wavelet filtering provided that the region-of-interest based 
approach is maintained. Images (data structures) with one or four or more 
dimensions can analogously be encoded by subband decomposition and modified 
zerotree coding applied.. 
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